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DESCRIPTION 
METHOD FOR ISOLATING SATELLITE SEQUENCES 
Technical Field 

The present invention relates to a method for isolating satellite sequences. 
Satellite sequences, which are useful as population genetic markers, can be used as 
markers for genetic linkage analyses, etc. Therefore, an efficient technique for 
isolating them is an important subject of research in the field of genome analysis. 

Background Art 

It has been known that eukaryotic genomes contain repetitive sequences 
consisting of similar nucleotide sequence repeats. The first discovered repetitive 
sequence had a repeat unit consisting of satellite DNA, which is a long sequence 
ranging from several hundred to several thousand bases, and is called a satellite 
sequence (Bioscience, 27: 790-796, 1977). Repetitive sequences consisting of 
shorter nucleotide sequence units were later identified. They have been named, 
depending on the size of their repeat unit, microsatellite sequences (Am. J. Hum. 
Genet., 4: 397-401, 1989) when they have a repeat unit of 2 to 5 bases (Nucleic 
Acids Res. 9: 5931-5947, 1981) or minisatellite sequences (Nature, 314: 67-73, 
1985) when they have a repeat unit of 10 to 64 bases (Nature, 295: 31-35, 1982). 
Microsatellite sequences are also called simple sequences (Nucleic Acids Res. 17: 
6463-6471, 1989) or short tandem repeats (Am. J. Hum. Genet. 49: 746-756), etc. 

Use of microsatellite sequences as markers was not reported when they 
were discovered. However, after their polymorphism was confirmed by 
polymerase chain reaction (PCR) (Am. J. Hum. Genet. 4: 397-401, 1989), they 
have attracted the attention of researchers for their use as markers in a variety of 
areas. Specifically, their application to pedigree or lineage discrimination and 
individual identification of humans, animals, and plants is known. Because 
microsatellite sequences are scattered throughout a genome and abound in variation, 
they are good genetic markers. Furthermore, a microsatellite DNA polymorphism 
contains many polymorphic gene loci and a large number of alleles per gene locus. 
In addition, based on PCR, microsatellite DNA polymorphism analysis is easy to 
perform. The amplified products are easily detected as single or double bands by 
electrophoresis, which simplifies determination of individual types and facilitates 
the data processing. Therefore, microsatellite DNA markers have become widely 



used as the most effective markers for population genetics (J. Fish. Biol., 47: 29-55, 
1995). 

For microsatellite DNA polymorphism analysis, first, a variety of 
microsatellite DNA should be isolated from individual species to be analyzed. In 
5 addition, the microsatellite DNA regions should be amplified by PCR to detect the 
microsatellite polymorphism. Appropriate primers are needed for PCR. This 
means that, to perform microsatellite DNA polymorphism analysis, efficient 
methods for isolating microsatellite DNA from the species and for designing PCR 
primers capable of amplifying the microsatellite region are required. 

10 The inventors have already reported a method that is expected to allow 

efficient isolation of microsatellites from poultry, for which microsatellite 
polymorphism analysis is not advanced (Jpn. Poult. Sci., 33: 292-299, 1996). This 
method is an improvement on a conventional method of microsatellite sequence 
isolation that consists of a series of manipulations: fragmentation of a genome with 

1 5 restriction enzymes, insertion of the fragments into a vector, extension reaction 
with (TG) n primers, and then cloning (Proc. Natl. Acad. Sci. USA 89: 3419-3423, 
1992). Namely, isolation of unknown microsatellite sequences was achieved by 
choosing a vector with high transformation efficiencies, by specifically digesting 
single-stranded DNA with mung bean nuclease, or by removing bacterial DNA and 

20 RNA with DNase I and RNase A. In chicken, it is considered difficult to efficiently 
isolate microsatellite sequences, because the number of microsatellite sequences is 
small. The above method could achieve, by calculation, six times more efficient 
isolation than the known method that was used in an attempt to isolate chicken 
microsatellite sequences (Poultry Science, 74: 1855-1874, 1995). 

25 However, even by this method, it was impossible to prevent problematic 

clones from contaminating the microsatellite DNA clones obtained. Namely, a 
high rate of duplicate clones was found among the clones obtained, suggesting bias 
in the constitution of clones as well as low efficiency. 

Microsatellite sequences are required to be isolated according to each 

30 species. There are few species for which the isolation of microsatellite sequences is 
in progress, and it is still necessary to isolate microsatellite sequences from many 
species. However, there are a number of species-specific problems, for example, 
low frequency of microsatellite sequences in the chicken genome. Therefore, it is 
useful to provide a novel technique for isolating microsatellite sequences, also 

35 because it will enable selection from a variety of approaches. 
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Disclosure of the Invention 

An objective of the present invention is to provide a method for isolating 
satellite sequences with high isolation efficiency. The present invention provides 
an isolation method more efficient than any known technique. 
5 The present inventors presumed that one of the major causes for generating 

duplicate clones during isolation of satellite sequences by known techniques might 
come from the fragmentation of the genomic DNA. In other words, the possibility 
that, when digested with restriction enzymes, the genomic library is biased due to 
the fact that sequence-specific cleavage cannot be excluded. 

10 Therefore, the inventors attempted to utilize a nucleotide 

sequence-independent cleavage method in which more arbitrary cleavage of the 
genomic DNA is expected. As a result, they found that cleavage of genomic DNA 
is possible with nucleotide sequence-independent DNA-digesting enzymes or by a 
physical action. However, since most of the physically cleaved genomic DNA 

1 5 fragments are not phosphorylated at their ends, they are not allowed to be 

efficiently incorporated into vectors using an enzyme. In addition, since physically 
cleaved fragments have an irregularly protruding strand at their cleavage site, they 
cannot be ligated. The inventors have overcome these problems by using several 
enzymatic treatments, and have established a method to obtain a more 

20 homogeneous genomic library. Furthermore, they have confirmed that the library 
thus obtained enables efficient isolation of satellite sequences, thereby completing 
the present invention. As used herein, the term "satellite sequences" indicates, 
unless otherwise mentioned, any repeat sequences which include microsatellite 
sequences and minisatellite sequences. Also as used herein, "microsatellite 

25 sequences" and "minisatellite sequences" indicate repeat sequences consisting of 
repeat units of 2 to 5 bp and 1 0 to 64 bp, respectively. Thus, the present invention 
relates to the following methods for isolating satellite sequences. 

(1) An isolation method for satellite sequences, wherein a genomic DNA is 
cleaved by a nucleotide sequence-independent method, the isolation method 

30 comprising: 

a) obtaining randomly cleaved fragments of the genomic DNA and 

b) selecting, from the fragments obtained in a), fragments comprising the 
satellite sequences. 

(2) The isolation method of (1), wherein the nucleotide sequence-independent 
35 method is a physical cleavage method or an enzymatic cleavage method. 



(3) The isolation method of (2), wherein the physical cleavage method is 
sonication. 

(4) The isolation method of (3), wherein the ends of the genomic DNA that have 
been fragmented by sonication are to be blunted. 

(5) The isolation method of (4), wherein the ends are to be blunted with DNA 
polymerase having single strand-specific endonuclease activity and 3'— > 5' 
exonuclease activity. 

(6) The isolation method of (2), wherein a nucleotide sequence-independent 
endonuclease is used in the enzymatic cleavage method. 

(7) The isolation method of (6), wherein the nucleotide sequence-independent 
endonuclease is DNase I. 

(8) The isolation method of (1), wherein the satellite sequences are microsatellite 
sequences. 

(9) Use of satellite sequences isolated by the isolation method of any one of (1) to 
(8) as DNA markers. 

In the present invention, it is critical to cleave a genomic DNA using a 
nucleotide sequence-independent cleavage method that gives random fragments. 
Cleavage of a genomic DNA using a nucleotide sequence-independent cleavage 
method refers to cleaving DNA without dependence on structural characteristics 
(i.e., nucleotide sequences) of the DNA. Therefore, neither chemical reactions in 
which DNA is cleaved in a nucleotide sequence-dependent manner nor digestion 
with restriction enzymes constitutes the cleavage method of the present invention. 
The genomic DNA to which the method of the present invention is to be applied can 
be prepared by known methods. Namely, cells are lysed enzymatically with 
protease, and are subjected to DNA extraction using an appropriate solvent. 

For cleavage of a genomic DNA using a nucleotide sequence-independent 
cleavage method, any of the above-described cleavage methods, for example, those 
based on physical actions and those based on the action of DNA-digesting enzymes 
that do not recognize the nucleotide sequence, can be used. Physical actions 
include sonication and agitation. On the other hand, as DNA-digesting enzymes 
that do not recognize the nucleotide sequence, DNase I and the like are known. Any 
of these methods can be used in the present invention. Above all, treatment by 
sonication is one of the most desirable manipulations with excellent reproducibility. 
By sonication, a large quantity of fragments with uniform lengths can be obtained 
in a short time. In contrast, partial digestion of a genomic DNA with DNase I 
requires strict setting of conditions for reproducibly obtaining fragments with 
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uniform lengths. In addition, this method often requires troublesome 
manipulations. 

As to the extent of cleavage of a genomic DNA, those skilled in the art can 
empirically set the conditions that will permit efficient production of fragments 
5 having the optimal size for the satellite sequence of interest. For example, if 

isolation of microsatellite sequences is intended, such conditions that give a library 
of fragments of 300 to 900 bp are selected. Considering the operation of ligating 
the cleaved fragments into a vector, it is desirable to use conditions under which the 
majority of fragments generated have a size of 300 to 500 bp or 500 to 800 bp. This 

10 is because, if fragments contain integral multiples of smaller fragments, such 

fragments are indistinguishable from the ligated products of more than one smaller 
fragment. For instance, when sonication at 20 kHz with an amplitude of 10 is 
performed on ice, fragments of 300 to 500 bp can be efficiently obtained if a 
sonication of 1 -minute duration is performed 1 to 5 times. Since a prolonged 

15 sonication may heat the sample and denature DNA, it is desirable to repeat the 
sonication with a shorter time duration. 

The genomic DNA cleaved by physical actions such as sonication tends to 
exhibit poor ligation efficiency, which prevents efficient incorporation of the DNA 
into vectors. For this reason, it is desirable to blunt the ends of the DNA fragments. 

20 In the present invention, any of the known blunting methods can be used: removal 
of the protruding ends with single strand-specific endonuclease, and further 
synthesis of the complementary strand at the 3' recessed ends with DNA 
polymerase to ensure the blunting. Specifically, mung bean nuclease and S 1 
nuclease can be used as the single stranctspecific endonucl eases. On the other hand, 

25 DNA polymerase having 3'— > 5' exonuclease activity can be used as the DNA 

polymerase. Such DNA polymerases include T4 DNA polymerase, pfu polymerase, 
and KOD polymerase. Mung bean nuclease, which is used in the Example, is a 
single strand-specific endonuclease and achieves blunting by digesting the 
protruding portion of the single strands of the DNA. In the Example, the action of 

30 T4 DNA polymerase is further utilized to ensure the blunting. T4 DNA polymerase 
digests the protrusion at the 3' ends by its strong 3'— > 5' exonuclease activity, and 
at the same time, it can synthesize complementary strands at the 3' recessed ends, 
thereby giving blunt ends. However, T4 DNA polymerase cannot blunt a recessed 
3 ' end with a phosphate group. Therefore, it is desirable to use mung bean nuclease 

35 and T4 DNA polymerase together. In such a case, treatment with mung bean 

nuclease cleaves gaps generated in the double-stranded DNA by sonication, as well 
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as blunts the ends. As a result, fragments with lengths shorter than expected may be 
produced. In order to obtain a library free from shorter fragments, it is desirable to 
provide a step of isolating the fragments of expected size after treating with mung 
bean nuclease. Fragments of expected sizes can be isolated by agarose gel 
5 electrophoresis or by gel filtration. The blunt-ended genomic DNA fragments can 
be directly ligated into vectors. Alternatively, they can be phosphorylated at their 
5' ends to improve the efficiency of ligation. T4 polynucleotide kinase can be used 
for phosphorylation. 

The genomic DNA fragments with phosphorylated 5' ends can self-ligate 

1 0 during the ligation into the vector. To prevent the self-ligation, the concentration of 
the fragments should be reduced, while excess concentration of the vector is 
employed so as to increase the opportunities for the fragments to be ligated into the 
vector. For the ligation reaction, T4 DNA ligase works in the ligation buffer and 
ligation efficiency can be further improved by carrying out the ligation in the 

15 presence of the restriction enzyme used to cleave the vector. For example, when 
pCR-ScriptSK(-l-) is used as the vector, restriction enzyme Srfl can help keep the 
vector in a linear state. Since Srfl does not act on the vector into which an insert is 
ligated, one can expect the ligation efficiency to be improved. 

In the present invention, not only physical actions but also nucleotide 

20 sequence-independent DNA-digesting enzymes can be used for fragmentation of 
DNA. These enzymes include, for example, DNase I, etc. For enzymatic treatment 
with DNase I, conditions that allow uniform action of the enzyme on the whole 
genome and that also allow production of fragments with expected sizes should be 
empirically provided. For example, in order to maintain the enzymatic action in a 

25 uniform state, genomic DNA with highest purity should be used after nuclear 

proteins are removed. Also, the enzymatic reaction should be performed as quickly 
as possible in order to avoid producing a large number of small fragments. In 
addition, conditions such as reaction time and temperature should be accurately 
controlled in order to maintain high reproducibility. After the enzymatic reaction is 

30 completed, the reaction is stopped by heating or other means. The nucleic acid 
component is recovered from the reaction, and, if necessary, particular fragments 
are extracted, followed by incorporation of the fragments into the vectors as 
described above. The fragments of the genomic DNA digested with the enzyme are 
blunt-ended and phosphorylated, which allows them to be directly used for ligation. 

35 The vectors used in the present invention are not specially restricted. 

Specifically, known vectors such as pCR-ScriptSK(+), pBluescriptKS(+) (both are 
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manufactured by Stratagene), and pUCl 8 can be used. After an appropriate host is 
transformed with the vector including the insert and is grown, the DNA is recovered 
to make a genomic library. When pCR-ScriptSK(+) is used for transformation, it 
may be advantageous to use host strains having excellent transformation 
5 efficiencies, such as E. coli (competent cell) XLl-Blue MRF\ XL2-Blue MRF', 
and TGI (all are manufactured by Stratagene). 

Using the genomic library thus constructed, transformants containing the 
satellite sequences of interest can be cloned. Screening for the satellite sequences 
can be performed based on colony hybridization using a probe for a satellite 

10 sequence (Can. J. Fish. Biol. 51 : 1959-1996, 1994) or on primer extension (Proc. 
Natl. Acad. Sci. USA, 89: 3419-3423, 1992). In colony hybridization, colonies of 
the transformed cells as described above are transferred onto a filter, and hybridized 
with a probe having, for example, a repeat sequence of (GT) n . The nucleotide 
sequence of the oligonucleotide that constitutes the probe is properly provided 

1 5 based on the satellite sequence of interest. Positive colonies are separated to isolate 
the clones containing the satellite sequence. 

In primer extension, on the other hand, the genomic library is recovered as 
single-stranded DNA. For example, if pCR-ScriptSK(+) is used as the vector, the 
transformed E. coli cells can be infected with helper phage VCS-M13 or the like to 

20 recover the inserts in a single-stranded form as recombinant phages. Since DNA 
packaged in the phage is protected against enzymatic action, RNA and DNA 
derived from E. coli can be enzymatically degraded when the DNA of interest is 
recovered, thereby achieving more efficient cloning. 

To the single-stranded plasmid DNA isolated from the phage, a primer 

25 specific to the satellite sequence is annealed, and then, DNA polymerase is applied. 
For example, repetitive sequences of (dA-dT) n are generally (0.3% of the entire 
genomic DNA) found in the mammalian microsatellite sequences. In humans, such 
combinations as (dC-dA/dT-dG) n and (dC-dT/dA-dG) n are also frequently 
observed. Based on such information, one can design primers that comprise a 

30 nucleotide sequence complementary to a satellite sequence and that anneal to the 
satellite sequence. It is advantageous to phosphorylate the 5' end of the primer 
beforehand. Vectors that contain a microsatellite sequence acquire the primer at 
this site and a double-stranded DNA is synthesized, while those to which the primer 
does not anneal remain single-stranded. Ligation is performed after the synthesis of 

35 the complementary strand with DNA polymerase to complete the double-stranded 
circular DNA, and then the single-stranded DNA is digested with mung bean 
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nuclease or the like. Consequently, only the vectors containing the satellite 
sequences remain. These vectors can be recovered and used to transform cells, 
thereby cloning the vectors. After amplifying the cloned vectors containing the 
satellite sequences, one can recover the DNA and determine the sequences, thereby 
5 completing the isolation of the satellite sequences. 

When a satellite sequence is isolated together with its flanking regions, 
primers for PCR can be designed. For designing the primers, it is convenient to 
utilize a commercially available package of software for sequence analysis. For 
example, in the Example below, a primer designing software Primer Premier 

10 (Premier Biosoft International) is used. The microsatellite sequences and 

minisatellite sequences isolated are useful as fingerprint markers for intraspecies 
analysis or as pedigree (or lineage) markers. If any satellite sequence obtained in 
the present invention exhibits a polymorphism, such a nucleotide sequence can be 
used as an indicator for identification of individuals. Such a satellite sequence is 

15 called a fingerprint marker. Since satellite sequences used as fingerprint markers 
are inherited from the parents of an individual, analysis of the fingerprint markers 
conserved among multiple individuals indicates the probability that these 
individuals belong to a single pedigree. Such use of a satellite sequence is called a 
pedigree (or lineage) marker. 

20 Furthermore, the satellite sequences isolated in the present invention can 

also be used as genetic linkage markers, in addition to the use as fingerprint markers 
and pedigree (or lineage) markers. In the present invention, a satellite marker 
utilized as an indicator represented by the uses exemplified above is called a DNA 
marker. 

25 

Brief Description of the Drawings 

Figure 1 shows the coloring image of a dot-blot assay with a microsatellite 
sequence-specific probe ((CA)io oligonucleotide). 

30 Best Mode for Carrying out the Invention 

The present invention is described more specifically in the following 
examples. 

(1) Preparation of DNA fragments from black abalone 
35 Black abalone (Haliotis discus discus) DNA was isolated from foot muscle 

according to the TNES-Urea method (Fisheries Sci., 62: 723^726, 1996). However, 
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4 M urea was used herein. The genomic DNA solution (20 ng/ml, 550 jil) was 
sonicated (20 kHz, Amplitude 10 3 for one minute, five times) with cooling to 
fragment the DNA. A reaction mixture (52 jil) containing the substrate DNA 
(about 10 jag), 30 mM CH 3 COONa (pH 4.6), 50 mM NaCl, 1 mM (CH 3 COO) 2 Zn, 
5 5% glycerol, and 60 units of mung bean nuclease (Toyobo) was incubated for 1 
hour at 37°C to cleave the gap portions of the DNA fragments and to blunt the 
protruding ends. By electrophoresis on a 1 .2% agarose gel, DNA fragments of 300 
to 500 bp were recovered. A reaction mixture (100 jal) containing the recovered 
substrate DNA, 20 (iM dNTP, 50 mM Tris/HCl (pH 8.5), 7 mM MgCl 2 , 15 mM 

10 (NH 4 ) 2 S04, 10 mM 2-mercaptoethanol, 0.1 mM EDTA, and 10 units of T4 DNA 
polymerase (Toyobo) was incubated for 1 hour at 37°C to blunt the protruding ends 
of the DNA fragments. Further, a reaction mixture (50 |al) containing the substrate 
DNA, 0.2 mM rATP, 50 mM Tris/HCl (pH 7.6), 10 mM MgCl 2 , 10 mM 
2-mercaptoethanol, and 10 units of T4 polynucleotide kinase (Toyobo) was 

1 5 incubated for 1 hour at 37°C to phosphorylate the 5' ends of the DNA fragments. 

(2) Ligation of genomic DNA fragments into plasmid vectors 

A reaction mixture (10 jal) containing pCR-Script SK(+) vector (Stratagene) 

(about 1 jug), 25 mM Tris/ Acetate (pH 7.6), 100 mM KOAc, 10 mM MgOAc, 0.5 
20 mM 2-mercaptoethanol, 10 jag/ml BSA, and 10 units of restriction enzyme Srfl 

(Stratagene) was incubated for 1 hour or more at 37°C, to cleave the vector at the 
. Srfl site. A reaction mixture (50 jal) containing the vector solution prepared, the 

substrate DNA (about 3 jag), 10 units of Srfl, 0.5 mM rATP, 66 mM Tris/HCl (pH 

7.6), 6.6 mM MgCl 2 , 10 mM dithiothreitol, and 20 units of T4 DNA ligase 
25 (Toyobo) was incubated overnight at 24°C to ligate the DNA fragments into the 

vector. After inactivating the DNA ligase by heating at 65°C for 15 minutes, 10 

units of Srfl was further added. The reaction mixture was incubated for 1 hour at 

37°C to digest the self-ligated vector. 

30 (3) Preparation of single-stranded DNA 

To each of five culturing tubes, 100 jal of XL2-Blue MRF' ultracompetent 
cells (Epicurian coli ultracompetent cells; Stratagene) was added, and then, the 
ligated vector (about 200 ng) was added, followed by transformation according to 
the attached instruction manual. After 900 |al of NZY medium (Maniatis, 

35 Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory 
Press, New York, 1989) was added to each of the tubes, the cells were cultured for 

\ 
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one hour at 37°C with shaking. Subsequently, ampicillin was added to a final 
concentration of 50 jag/ml. The cells were cultured for another hour at 37°C with 
shaking to select the transformed E. coli. To each of the tubes, about 10 10 pfu of 
VCS-M13 helper phage was added. The cells were allowed to stand for 20 minutes 
5 at 37°C, followed by addition of kanamycin at a final concentration of 70 jag/ml. 
The cells were cultured for 1 hour at 37°C with shaking to select the E. coli infected 
with the helper phage. All the cultures were put together and added to 100 ml of 
Terrific medium (Maniatis, Molecular Cloning: A Laboratory Manual, 2nd ed., 
Cold Spring Harbor Laboratory Press, New York, 1989) containing ampicillin and 

10 kanamycin. The cells were cultured for 14 hours at 37°C with shaking. The 
Terrific medium was centrifuged at 14,000 rpm for 10 minutes at 0°C, and the 
supernatant was recovered and filtered twice through a 45 jam filter. After 
precipitating twice with PEG according to the standard method (Muramatsu, Lab 
Manual of Genetic Engineering, 3rd ed. s Maruzen, Tokyo, 1996, pp.5 1-55), the 

15 DNA and RNA derived from E. coli were digested according to a known method 
(Jpn. Poult. Sci. 33: 292-299, 1996). Subsequently, PEG precipitation, phenol 
extraction, and ethanol precipitation were performed to purify the single-stranded 
DNA. 

20 (4) Selection of (CA) n -positive plasmids 

Primer extension was performed using a (CA)i 2 oligonucleotide to select the 
plasmid DNA into which DNA fragments with (TG/CA) repeats had been inserted. 
The solution (98 jal) containing the single-stranded DNA (about 3 jig), 0.2 raM 
dNTP, 20 mM Tris/HCl (pH 8.8), 10 mM KC1, 10 mM (NH^SC^, 2 mM MgS0 4 , 

25 100 ng/ml BSA, 0.1% Triton X-100, and 100 pmol (CA)i 2 oligonucleotide was 

pre-heated at 72°C for 10 minutes. Five units of Pfu DNA polymerase (Stratagene) 
was added to the solution, which was then overlaid with mineral oil and incubated 
for 30 minutes at 72°C. Subsequently, the generated double-stranded DNA was 
recovered by phenol extraction and ethanol precipitation and was dissolved in 10 jliI 

30 of sterilized water. After circularizing the double-stranded DNA by using a ligation 
kit (Ligation high, Toyobo), the DNA ligase was inactivated by heat treatment 
(65°C for 15 minutes). A reaction mixture (100 ^1) containing this solution, 30 mM 
CH 3 COONa (pH 4.6), 50 mM NaCl, 1 mM (CH 3 COO) 2 Zn, 5% glycerol, and 30 
units of mung bean nuclease (Toyobo) was incubated for 2 hours or more at 37°C to 

35 digest the single-stranded DNA which had not been primer-extended. DNA was 
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recovered by phenol/chloroform extraction and ethanol precipitation, and was 
dissolved in 60 jal of TE buffer. 

With the recovered DNA (equivalent to 100 |ug of the single-stranded 
DNA), XL2-Blue MRF' Ultracompetent cells were transformed. The transformed 
5 cells were spread onto a 2xYT agar medium containing 50 jxg/ml ampicillin 

(Maniatis, Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor 
Laboratory Press, New York, 1989), and cultured overnight at 37°C. Single 
colonies on the culture plate were picked up at random, and cultured overnight at 
37°C with shaking in 2 ml of 2xYT medium (Maniatis, Molecular Cloning: A 

10 Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, New York, 

1989) containing ampicillin at a final concentration of 50 jag/ml. The plasmid was 
extracted by the alkaline method according to the standard procedure (Muramatsu, 
Lab Manual of Genetic Engineering, 3rd ed., Maruzen, Tokyo, 1996, pp.5 1-55), 
and dissolved in 50 jal of TE buffer. 

1 5 The purified plasmid DNA (equivalent to 4 ng of the single-stranded 

DNA) was alkaline-denatured with 0.5 N NaOH and blotted onto a nylon 
membrane (No. 1209299, Boehringer Mannheim). The membrane was baked at 
120°C for 30 minutes. Using (CA)i 0 oligonucleotide digoxigenin-labeled with the 
DIG oligonucleotide labeling kit (DIG Oligonucleotide Tailing Kit, Boehringer 

20 Mannheim) as a probe, and using the DIG nucleic acid detection kit (DIG Nucleic 
Acid Detection Kit, Boehringer Mannheim), (CA)n -positive plasmids were detected 
according to the manufacturer's instruction. Part of the results is shown in Figure 1 . 

(5) Cycle sequencing 

25 For the (CA) n -positive clones, nucleotide sequences were determined using 

the cycle sequencing method. A KS primer labeled with Cy5 at its 5' end, a 
Reverse primer, and a sequencing kit (ThermoSequence fluorescent labeled primer 
cycle sequencing kit with 7-deaza-dGPT; Amersham) were used for the sequencing 
reaction. The fluorescent DNA sequencer (ALFexpress, Pharmacia) was used for 

30 sequencing. Based on the nucleotide sequence data thus obtained, optimal primers 
were designed for the region encompassing the CA repeats using software for 
primer design (Primer Premier; Premier Biosoft International). The nucleotide 
sequences of the microsatellite sequences isolated from the black abalone are 
shown below. Repetitive units are shown in the parentheses so that they are easily 

35 recognized. The actual sequences are as shown in the sequence listing (the numbers 
correspond to the SEQ ID NOs). For the regions 1 to 24, PCR primers can be 
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designed, and a polymorphism was identified for nine regions (1, 3, 8, 10, 13, 15, 16, 
19, and 24). 

1. (CA) 4I 

2. (GACT) 2 (CTCA) 7 (CA) 2 CT(CA)9 
5 3. C 5 CAC 2 (CA) 12 TA(CA) 8 

4. (CA) 7 

5. (CA) 16 

6. (GA) 2 CAGA(CA) 5 

7. CA 3 GA 2 C 3 A 3 (CA) 5 

10 8. (CGCA) 9 TGCAC 2 (CA) 2 

9. (CT) 3 (CA 2 ) 3 (CA) 5 

10. (CA) 25 

11. CACT(CA)i 6 TACA 

12. CA 3 (CA) 2 T(CA) 4 
15 13. (CA) 30 

14. CA 2 GCA 2 C(CA) 25 

15. (CA) 2 CT(CA) 13 (CGCA),i(CA) 6 

16. (CA) 8 (CG) 4 

17. (CA) 6 (CG) 4 
20 18. (CA) 5 

19. (CA) 26 

20. TACATA(CA), 2 

21. (CA) 2 CA 3 (CA) 6 

22. (CA) 2 AC(CA) 3 AC(CAC) 2 (CA) 5 
25 23. (CA) 8 (TGCA) 2 

24. (CA) 34 

25 . (C A) 7 (CGC A) 2 CG A 2 (CGC A>2 A 2 (C A) 2 (CG) 2 

26. (CA) 8 

27. CAC 8 (CA) 9 C 4 
30 28. (CA) 6 (GA) 2 

29. (CA) 26 

30. (CA) 25 

31. CAG(CA) 5 TACA 

32. (CA 3 ) 3 (CA) 4 CA 3 (CA)i 2 G 2 CA(CG 2 ) 3 

35 About 85% (41 clones) of the (TG/CA) n -concentrated library constructed 

based on the sequence-independent cleavage method using sonication was found to 
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be (CA) n -positive clones (Figure 1). Out of them, 32 (CA) n -positive clones were 
selected at random and sequenced. As a result, PCR primers were designed for 24 
clones (72%). For the rest of the clones (9 clones), it was impossible to design the 
primers because some repeat regions were adjacent to the ligation site on the vector, 
5 because some regions were ligated into the vector in the middle of the repeat, or 
because some repeats were scattered throughout the DNA fragment. 

Industrial Applicability 

The present invention makes it possible to efficiently isolate satellite 

10 sequences such as microsatellite sequences. Since the present invention utilizes a 
sequence-independent cleavage method for fragmentation of a genomic DNA, it is 
not influenced by the nucleotide sequence, and enables the production of unbiased 
libraries. Satellite sequences isolated from the unbiased library contain reduced 
redundancy. Thus, the present invention enables efficient isolation of satellite 

15 sequences. 

For instance, the present inventors have succeeded in obtaining a 
(TG/CA) n -concentrated library comprising 32 clones by a single manipulation from 
black abalone, a source from which no microsatellite sequences have been isolated 
previously. The clones obtained from this library are free from redundancy. It can 

20 be concluded that, compared with the previous method that produced many 

duplicated clones for chicken, the present method has increased the efficiency of 
the isolation. The isolated clones can serve for designing PCR primers as well as 
show reduced sequence redundancy, which is another striking effect of the present 
invention. For instance, calculated from the result of the example, at least 60% 

25 (0.85 x 24/32) of the clones cultured on the plate can be used to design PCR primers. 
These results reveal that the methods of the present invention for isolating 
microsatellite sequences are excellent methods that can be widely applied. 

Furthermore, the satellite sequences isolated by the methods of the present 
invention can be utilized as DNA markers, such as fingerprint markers, pedigree (or 

30 lineage) markers, and genetic linkage markers. 



